Improving Object Localization with Fitness NMS and Bounded IoU Loss

نویسندگان

  • Lachlan Tychsen-Smith
  • Lars Petersson
چکیده

We demonstrate that modern detection methods are designed to identify only a sufficently accurate bounding box, rather than the best available one. To address this issue we propose a simple and fast modification to the existing methods called Fitness NMS. This method is tested with the DeNet model and obtains a significantly improved MAP at greater localization accuracies without a loss in evaluation rate. We also derive a novel bounding box regression loss based on a set of IoU upper bounds and investigate simple RoI clustering schemes for improving evaluation rates for the DeNet wide model variants. Following these novelties we provide an analysis of input image dimensions with the DeNet model. We obtain a MAP[0.5:0.95] of 33.6%@79Hz and 41.8%@5Hz for MSCOCO and a Titan X (Maxwell). These results provide further evidence that DeNet and Fitness NMS methods can be scaled to provide highly competitive performance at a broad range of evaluation rates without requiring the manually specified bounding box anchors used in competing methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TALL: Temporal Activity Localization via Language Query (Supplemental Material)

To directly compare our the temporal regression method with previous state-of-the-art methods on traditional action detection task, we did additional experiments on THUMOS-14. Since THUMOS is a classification task with limited number of action classes, we removed the cross-modal part and trained the localization network with classification loss (cross-entropy loss) and regression loss. We train...

متن کامل

Activity-Net 2017 Challenge Report

In this year’s competition, we use Temporal Context Network (TCN) for precise temporal localization of human activities. To improve performance for the metric used for evaluating proposals (i.e. area under the Average Recall vs. Average Number of Proposals per Video (AR-AN) curve with 100 proposals), we study the influence of two major hyper-parameters: the IOU threshold for NMS and the number ...

متن کامل

S-OHEM: Stratified Online Hard Example Mining for Object Detection

One of the major challenges in object detection is to propose detectors with highly accurate localization of objects. The online sampling of high-loss region proposals (hard examples) uses the multitask loss with equal weight settings across all loss types (e.g, classification and localization, rigid and non-rigid categories) and ignores the influence of different loss distributions throughout ...

متن کامل

Localization at prime ideals in bounded rings

In this paper we investigate the sufficiency criteria which guarantee the classical localization of a bounded ring at its prime ideals.

متن کامل

Truncated Linear Minimax Estimator of a Power of the Scale Parameter in a Lower- Bounded Parameter Space

 Minimax estimation problems with restricted parameter space reached increasing interest within the last two decades Some authors derived minimax and admissible estimators of bounded parameters under squared error loss and scale invariant squared error loss In some truncated estimation problems the most natural estimator to be considered is the truncated version of a classic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1711.00164  شماره 

صفحات  -

تاریخ انتشار 2017